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In this paper, the problem of finding optimal success probabilities of static linear optics quantum gates is 
linked to the theory of convex optimization. It is shown that by exploiting this link, upper bounds for the 
success probability of networks realizing single-mode gates can be derived, which hold in generality for linear 
optical networks followed by postselection, i.e., for networks of arbitrary size, any number of auxiliary modes, 
and arbitrary photon numbers. As a corollary, the previously formulated conjecture is proven that the optimal 
success probability of a postselected non-linear sign shift without feed-forward is 1/4, a gate playing the central 
role in the scheme of Knill-Laflamme-Milbum for quantum computation with linear optics. The concept of 
Lagrange duality is shown to be applicable to provide rigorous proofs for such bounds for elementary gates, 
although the original problem is a difficult non-convex problem in infinitely many objective variables. The 
versatility of this approach to identify other optimal linear optical schemes is demonstrated. 
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Optical implementations of quantum information process- 
ing devices offer many advantages over implementations em- 
ploying other physical systems. Photons are relatively prone 
to decoherence, and precise state control is possible with the 
help of linear optical elements fl^. Moreover, although the re- 
quired non-linearities to do universal quantum computation 
are presently not available at the single-photon level, they 
can be effectively realized by means of measurements. This 
comes at the price of the scheme becoming probabilistic. It 
was one of the key insights in the field, proposed by Knill, 
Laflamme, and Milburn that quantum computation can be 
achieved in a near-deterministic way using only single_photon 
sources, linear optical elements and photon counters [2,'3',0]. 
For this to be possible a significant overhead in resources is 
required Q|5|]. At the basis of the construction of the origi- 
nal scheme, however, is a gate that is implemented with some 
probability of success, the non-linear sign shift gate i^lHI^. 
The best known success probability of this gate using static 
linear optics followed by postselection is one quarter; this can 
then be uplifted to close to unity using teleportation steps. 

One of the central questions seems therefore: how well can 
the elementary gates be performed with static linear optics 
networks? In particular, what are the upper bounds for suc- 
cess probabilities of energy-preserving gates of single-modes? 
This seems a key question for two reasons: on the one hand, 
the success probability at the level of elementary gates is a 
quantity that determines the necessary and notably large over- 
head to achieve near-deterministic scalable quantum compu- 
tation j301- On the other hand, for small-scale applications 
such as quantum repeaters for the long-range distribution of 
entanglement, high fidelity of the quantum gates may often be 
the demanding requirement of salient interest. The achievable 
rates in entanglement distillation, say, may be of secondary 
importance compared to the very functioning of the scheme. 
In such contexts, one should be expected to be well advised to 
abandon some of the feed-forward using quantum memories 
or delay lines, but rather postselect the outcomes. 

The best known scheme to realize the non-linear sign shift 
gate with linear optics without feed-forward succeeds with a 



probability of a quarter. Later Knill showed that the success 
probability can at most reach one half [3]. This was an impor- 
tant step: it was not clear, yet, whether this bound was indeed 
tight. Aiming at tightening this bound, Scheel und Liitkenhaus 
made a further significant step, emphasising that a linear op- 
tics network realizing a quantum gate can be thought of as one 
which is linked once to the input mode by a single beam split- 
ter Jsl], based on a result by Reck, Zeilinger and co-workers 
llgl flOlfTlll . It was conjectured, based on a numerical analysis 
in a restricted setting, that the maximal success probability of 
this gate could be one quarter. 

It is the aim of this paper to link the question of success 
probabilities to the theory of convex optimization lfl2l[T3lll4l 
\ly] . It turns out that convex optimization provides powerful 
analytical methods to prove the validity of bounds to optimal 
success probabilities, without having to resort to restrictions 
of generality. By doing that, we arrive at rigorous tight upper 
bounds for quantum gates in the framework of linear optics 
quantum computation with no feed-forward on the level of el- 
ementary gates. In particular, it is proven that the non-linear 
sign shift gate can in fact be optimally realized with a suc- 
cess probability of exactly 1/4. Non-linear phase gates and 
equivalents in higher Fock layers are also considered. These 
methods will turn out to provide helpful tools, although the 
original problem has infinite dimension and is, to start with, 
non-convex. The central difficulty here in the problem is that 
one cannot bound the size of the auxiliary network a priori: It 
may well be that large networks go in hand with a significant 
advantage fl6ll . 

Let us start by stating the considered setting: we aim at 
formulating a general recipe to find upper bounds for success 
probabilities of gates of single modes preserving the energy 
using (i) photon sources, (ii) photon counters follows by post- 
selection, and (iii) static linear optical networks of any size, 
using an arbitrary number of auxiliary modes and photons and 
an arbitrary number of network elements, but without feed- 
forward on the level of individual gates (in which case the 
unit probability as only upper bound is already known from 
the original |2] and alternative schemes j^l). We will consider 
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quantum gates of the form 

|V'in>= 2;o|0>+2;i|l> + ... + yAr|A^> 



(1) 



where \n) denote the state vectors of number states and 
...,<j>N S To set the phase (f)o = merely corresponds 
to a change of the global phase and does not restrict gener- 
ality. This includes the important example of the non-linear 
sign shift gate, acting as 

iV'in) - 2;o|0)+2;i|l>+y2|2)^ (yo|0)+2;i|l)-2;2|2». 

In a static linear optical realization of the quantum gates, the 
gate can only be realized with a non-unity success probabil- 
ity. Any network constisting of linear optical elements can be 
decomposed into three steps, as has been pointed out in Ref. 
ll^ based on Ref. 1^ : (i) a preparation of a distinguished 
auxiliary mode 2 and all (unboundedly many) other auxiliary 
modes jointly labeled 3 in some initial pure state, (ii) A uni- 
tary operation of the input on 1 and 2, reflecting an applica- 
tion of a central beam splitter with transmittivity t E [—1,1] 
(a convenient convention) and phase Lp S [0, 27r). (iii) A mea- 
surement of all modes labeled 2 and 3, associated with a state 
vector 1 77). As a consequence, any optimal static linear optical 
network of a single input mode is reflected as a map 

Pm^xUpinU^ = ('71(^1.2 «) l3)(Pin «> \uj) (uj\) {Vl^ ® Ig)^), 

for all input states pi^ ~ |V'in)(^'in| of the input mode labeled 
with 1, Vi,2 is the unitary of the central beam splitter charac- 
terized by a real transmittivity t and phase (p. Writing 



k=0 



Xk+i\k) (g> \ujk) 



with real numbers xi, Xn+i, we have to require that 



E 

fe=0 



Xk+lfi^''sk+l 



for all j = 0, N, with e^^+i = (vlk) \(^k) and 
(1) 

where the real g^: , introduced for convenience of notation, 
depend on t G [—1,1] only. 

The problem is essentially now to find the optimal trans- 
mittivity t e [—1,1], phase p G [0, 27r), state vectors \ri) 
and \ujo), ...,\ujn) for the optimal n e N, and the optimal 
xi, Xn+i in order to bound the optimal success probabil- 
ity. This is as such a very involved problem: The number n 
cannot be bounded from above, meaning we cannot a priori 
bound the required resources in the network. This makes it 
formally an infinite-dimensional problem. The function we 
consider is not convex, so we may and are expected to en- 
counter infinitely many local maxima. So even numerically, 
without truncating the problem cannot be solved as such. In 



order to circumvene these difficulties, two central ideas will 
be employed: We treat part of the objective variables as pa- 
rameters in the problem, such that the remaining problem can 
be relaxed to a convex quadratic program. In this way we can 
exploit methods from convex optimization. For this resul ting 
problem we make use of the ideas of Lagrange duahty 1151 
and outer approximations, and are able to explicitly construct 
a family of solution to the dual. Let us first clearly state the 
strategy: 

(I) We consider the problem for each t £ [—1,1], p £ 
[0, 2n), each n € N, and all legitimate ei, £n+i as defined 
above. This choice will be denoted as {t, p, n, ei, £n+i)- 
(II) We formulate the remaining problem of finding upper 
bounds to the optimal success probabilities as a quadratic op- 
timization program, which can be relaxed to a semi-definite 
program iTsIl in xi, Xn+i- (HI) Then, we are in the posi- 
tion to establish the dual problem. (IV) A family of explicit 
constructions of solutions of the dual will be presented. (V) 
These solutions can be simplified such that the dependence on 
the specific choice of ei, and p and t can be elimi- 

nated. This will be done exploiting two key ideas: on the one 
hand, by using both families of solutions of the dual problem, 
dependent on ei, £„+i and ip, on the other hand by appro- 
priate convex outer approximations. These powerful methods 
will allow us to identify rigorous general upper bounds for all 
numbers of auxiliary modes, even though the original prob- 
lem is unbounded in size. In a sense, we approach the optimal 
solution 'from the other side'. 

(I) The first simplification is that we may choose any 
£1, £„+i for some n satisfying J^kt^ii'^k + 1^1) ~ 1' 
noting with ak, S R the real and imaginary parts of Sk, 
£fe = ttfe + i/3fc. Again for simplicity of notation, we intro- 
duce 

^i{iv-<t>j) _ ^(j) _|_ 
with ^(j), c^O) G R. Success of the gate requires that 



n+1 



n+1 



k=l k=l 
n+1 

Y^xkm^'^ +o.k&^)9'il, = 0, 

k=l 

for j,l = 0, n. Note that this is already a major simplifi- 
cation: instead of maximizing the actual trace of the state, we 
set the imaginary part to zero and avoid a very involved ad- 
ditional quadratic constraint at this point, without losing gen- 
erality. The square of the quantities of the first line in the 
previous equation is then the success probability. 

(II) We have to optimize for all {t, (p,n,ei, £n+i) over 

all weights xi, Xn+i satisfying Y!ik=i ^k ~ ~ ^■ 

This freedom corresponds to the weights in the preparation of 
the initial state of the auxiliary modes. The fact that we cannot 
restrict the size of the linear optics network is here reflected 
by the fact that we have to optimize over all possible weights 
corresponding to different preparations, even over all n. In 
this form, however, we will see that the problem is handable. 
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The constraint x^x = 1 can be relaxed to x^x < 1, which is 
a convex quadratic constraint that can also be written as 



1 x' 

X ln+l,n+l 



> 0. 



So in general, the problem of assessing a bound for the opti- 
mal success probability can be reduced to the following max- 
imization problem in the vector x = (.ti, Xn+i), reflecting 
the maximization of the success probability. The maximiza- 
tion problem in this vector (but not the full problem) is found 
to be manifestly of the form of a so-called semi-definite opti- 
mization problem fl^ . After a number of elementary steps, 
the maximization problem in this vector can be cast into the 
following convenient form of a maximization problem in the 
real symmetric matrix Z e ]R("+^) ^ '^"+^' as 

maximize — tr[FoZ], (2) 
subject to tr[ea.aZ] = 1, a=l,...,n + 3, 

tr[(ea,h + eb_a)Z] =0, a, 5 = 3, ...,n + 3, a^b, 

tT[ei^aZ] = tr[ea,i^] = 0, a 2, n + 3, 

tr[F,Z]=0, J = l,...,2N + 2. 

Z >0. 

The square of the solution is an upper bound for the success 
probabihty. Here, Fq = diag(l, 0, 0), and 



(III) We can now formulate the dual problem to this op- 
timization problem delivering the bounds, as a solution can 
explicitly be constructed IITIi . It can be shown that the dual 
problem can be written as follows, which is now a minimiza- 
tion problem in the objective vectors z G R"+^, v G R^^, 
and the matrix V e ]r("+3)x(«+3) 



Oil ® 



(cO))T„(c(0))T 
Ori+l,ri + l 



j = 1, N, correspond to the matrices that ensure the proper 
realization of the gate on the level of the real part, and 



Fj+N+i — Oi,i ® 







0)1+1, n+l 



j = 0, N, to the complex part, with Ofc,; denoting the k x / 
matrix all elements of which are zero. The matrix 



F, 



2N+2 





-c(")/2 



-(c(0))^/2 

Ori+l,Ti+l 



finally links the contraints in the primal problem. Here, the 
abbreviations 



are used for j ^ 0,...,N. The matrix ea,b S r("+3)><("+3) 
denotes the matrix all entries of which are zero, except of an 
entry 1 at (a, 5). The latter matrix F2N+2 can be replaced by 



G = li 



-7c(o)/2 
-7(c("))^/2 0„+i,„+i 



with7 S [1, 00) to be fixed later, such that p^ax corresponds to 
the square of the optimal objective value of Eq. (|2ji for 7=1, 
and is smaller for 7 > 1. This seemingly irrelevant modifica- 
tion will turn out to be a helful idea later on, to eliminate the 
dependence on the phase (p. 



27V+1 



minimize q z, 
subject to Fo + diag(0, zi, z„+2 

a=l 

+ V + V2N+2G > 0, 

where q = (1, 1), and matrix V has to be of the form V = 
O2.2 © W, with W e ]r("+i)x("+i) being a real symmetric 
matrix satisfying Wa.a = for all a = 1, n+ 1. In general, 
every solution of a dual problem to a semi-definite problem 
gives a bound to the optimal solution to the primal problem, 
as is not difficult to see fT^l . Once we are able to construct a 
solution z of the dual for all values of {t, (p,n,ei, e„+i), 
we arrive at a rigorous upper bound for the primal problem. 
As such. 



Pn 



gives an upper general bound of the desired success probabil- 
ity. 

(IV) We will now explicitly construct a family of solutions, 
dependent on a single number (5 e R. The presented solutions 
may look like unlikely objects, yet, they will deliver the de- 
sired bounds. To construct the family of solutions for the dual 
amounts to finding appropriate values for a matrix of arbi- 
trary dimension. The intuition behind the construction draws 
from two essential observations: on the one hand, the prob- 
lems in X and (ei, e„) are two intertwined quadratic prob- 
lems. So the solutions can be constructed such that the depen- 
dence from (ei, is entirely cancelled. The other cen- 
tral idea, dealing with the very involved constraints provided 
by polynomials of arbitrary order in i, is to get bounds by ap- 
propriately constructing provable outer approximations. This 
structure of the problem we encounter here is not only spe- 
cific for the optimization problem at hand, but expected to be 
a generic feature in problems related to linear optics; roughly 
speaking, the intertwined quadratic problems originate from 
the auxiliary systems, whereas the polynomial constraints of 
high order from the distinguished passive optical element. 

In the construction, to start with, we choose i'27v+2 = 1- 
Let for convenience w G R"+^ be defined as 

N N N 

w ^ (-7/2 - J2 ^^O*^^"^ + E '"^'-^'^ + E ^N+,+id^'\ 

i=i 3=1 j=o 

We are free to choose 



Vj = — cos{jip)i 
VN+j+i = sin(jV)sj, 



j = 1,...,N, with functions Sj : [—1,1] R"*" yet to be 
specified. This freedom will later give rise to the outer ap- 
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proximation. Then, let us set 

N 

7 = 2^ Sj(l - coa{ipj)) + 1. 

This means that always 7 > 1, which is used to eliminate the 
unwanted dependence of (p. That is, 

N N 

Wk/ak = (-1/2 + Sj)gf^ - ^ cos{4)j)sjg^^\ 
j=i i=i 

The matrix W e r("+i)x("+i) is taken to be of the form 

„, / WaWb, if 6 ^ a, 
^''^"=10, if6 = a. 

This construction yields a positive V fl9ll . Finally, we choose 

za = al_i5 

for a = 2, n + 2 and zi = S. With this choice, indeed 
Fq + diag(0, zi, Zn+2 

holds, so it is in fact a solution of the dual ifislfigil . This choice 
will indeed turn out to give the appropriate upper bounds. 

(V) If we can now find functions si , . . . , sat : [—1,1] — > 
such that there is a (5 G [0, 1] with 

\wk/ak\ < S (3) 

for all k — 0, n, we can in fact eliminate the dependence 
on ai, an+i and t, as we have then an outer approximation 
of the feasible set. The outer approximation defined by Eq. 
Q takes care of the polynomial constraints in t € [0, 1] of 
arbitrary order, constraints of a type that one would encounter 
in any optimization involving passive optical elements. 

We have then indeed established an upper bound: The 
above constructed solution yields 

n+l 

1=1 

so Pmax < 4(5^ is a rigorous upper bound for the success 
probability. So finding an upper bound for the success prob- 
ability amounts to finding solutions, possibly dependent on 
t e [—1, 1], for si, sn such that Eq. Q is satisfied. This 
provides a general method that can be applied to all of the 
above considered gates. It is important to note that although 
we had the freedom to construct this particular solution with- 
out caring whether this solution is unique or even optimal, this 
implies a rigorous bound for the primal problem, and therefore 
for the optimal success probability. This gives rise to a recipe 
for finding upper bounds for success probabilities for all the 
above quantum gates using linear optics. 

The example of the non-linear sign shift is on the one hand 
instructive to exemplify the general strategy, and on the other 



hand already the practically most important case. Here we 
have that N = 2 and (po = 1, (pi = 1, and (p2 = tt. For this 
case of = 2, one finds .g[°^ = t'', g[^'^ = t''-^{t^ - k{l - 

e)), and gf '^ = t''-^ {t^ - 2ke {\-e) + {\-e fk{k-l) /2) 
using standard expressions for the unitaries of beam splitters 
in the number state basis. We now have to show that for each 
t € [—1, 1] we can find si, S2 : [—1,1] such that Eq. 

(|3} is satisfied. More specifically, for alH e [— 1 , 1] we have 
to have find si , S2 such that 

-1/4 < (-1/2 + ,si + 52)5^ - siffi'^ + S25f < 1/4 

for all /s ~ 0, 00, so we have that h = 1/4. Such a choice 
is given by 

^ r (1/(1-0,0), ifte [-1,1-^/2), 

(si,S2) = -<^ (0,l/(l + t2)), ifte [1-V2,0), (4) 

* [ (1,1/2), ift e [0,1), 

for all k = 0, 00 I20I1 . This can be shown with elementary 
methods, on the basis of only the functions in Eq. such 
that Eq. (|3} holds for (5=1/4 for all k. This finally demon- 
strates that the optimal success probability of a linear optical 
implementation of the non-linear sign shift gate without feed- 
forward is indeed 1/4: there are known schemes that fulfill 
this bound. This settles the question of the optimal success 
probability of this key quantum gate in this setting. This state- 
ment is interestingly completely independent of the network 
size, as long as it includes at least two auxiliary modes. The 
surprising result is that more resources do not help at all, and 
the smallest known functioning scheme can already be proven 
to be optimal. This unexpected outcome may also be taken 
as a further motivation to further investigate hybrid solutions, 
slightly leaving the setting of linear optics 1I21I1 . 

The presented method can immediately be applied to assess 
optimal success probabilities of other quantum gates within 
the paradigm of linear optics. In order to exemplify the ver- 
satility of the approach, let us finally investigate two further 
quantum gates: this is on the one hand the non-linear phase 
shift gate, acting as 

2/0IO) + yijl) + y2l2) ^ yolO) +Vi\l)+ e'^^^j/2l2) 

with some phase (/12 G [0, 27r). Here, the presented method 
delivers immediately 

Pmax < (3 - C0s(7r - (t)2)f/l&, 

consistent with pmax = 1 for (/'2 = and pmax = 1/4 for 
(f)2 = tt: it hence depends on the phase how difficult it is to 
implement the gate. On the other hand, for the three photon 
gate 

2/0IO) +2/1II) +2/212) +2/313) ^yol0)+yill)+y2l2)-y3l3) 

we find that p^ax < 1/9. This indicates that for higher Fock 
layers, the optimal success probabilies becomes even smaller 
The implications for a number of further gates including the 
CNOT will be presented in a forthcoming publication. The 
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key point is that this method allows one to argue without hav- 
ing to restrict the amount of allowed resources or the size of 
the specific network realizing a scheme. Moreover, a finite 
number of rounds of measurements and feed-forward can in 
principle be incorporated in such a setting. Statements on the 
distinguishability using auxiliary systems I22I1 are also acces- 
sible. As such, these ideas are hoped to be useful to contribute 
to finding linear optical schemes that make use of the minimal 



resources, and to bringing linear optics quantum computation 
closer to feasibility. 

Discussions with W.J. Munro, S. Scheel, P. van Loock, C. 
Emery, J.D. Franson, K. Nemoto, P. Kok, M.A. Nielsen, N. 
Liitkenhaus, and M. Knill are warmly acknowledged. This 
work was supported by the DFG (SPP 1078), the EU (IST- 
2001-38877), and the EURYI Award Scheme. 
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